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We show that a large collection of statistical mechanical systems 
with quadratically represented Hamiltonians on the complete graph 
can be extended to infinite exchangeable processes. This extends a 
known result for the ferromagnetic Curie-Weiss Ising model and in¬ 
cludes as well all ferromagnetic Curie-Weiss Potts and Curie-Weiss 
Heisenberg models. By de Finetti’s theorem, this is equivalent to 
showing that these probability measures can be expressed as aver¬ 
ages of product measures. We provide examples showing that “fer¬ 
romagnetism” is not however in itself sufficient and also study in 
some detail the Curie-Weiss Ising model with an additional 3-body 
interaction. Finally, we study the question of how much the antiferro¬ 
magnetic Curie-Weiss Ising model can be extended. In this direction, 
we obtain sharp asymptotic results via a solution to a new moment 
problem. We also obtain a “formula” for the extension which is valid 
in many cases. 

1. Introduction. Let X = {Xi,... ,Xn) be a finite exchangeable collec¬ 
tion of random variables taking values in a space S which is assumed to 
be a closed subset of W, often a finite set or the {s — l)-dimensional unit 
sphere. (Finite exchangeable means that the distribution is invariant under 
all permutations of {l,...,n}.) In such a situation, one can ask whether 
X is extendible to an infinite exchangeable process. In other words, does 
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there exist a process Y = (Yi)i>i, taking values in S, whose distribution is 
invariant under finite permutations and such that X and (Yi,..., Yi) have 
the same distribution? (In this case, the process Y will often not be unique.) 
If the distribution of X is “an average of product measures,” meaning that 
the distribution of X can be expressed as 

Jpis) 

where P{S) is the set of probability measures on S, is the n-fold prod¬ 
uct of p and p is a probability measure on P{S) (endowed with an appro¬ 
priate cr-algebra), then it is immediate that X is extendible to an infinite 
exchangeable process. One such infinite exchangeable process of course has 
distribution 


( 1 . 1 ) 



dp{p). 


The important de Finetti theorem says that any infinite exchangeable pro¬ 
cess (when 5 is a complete separable metric space) can be expressed as in 
(1.1) for a unique p. Hence X is extendible to an infinite exchangeable pro¬ 
cess if and only if it is “an average of product measures.” When a finite 
exchangeable process is extendible to an infinite exchangeable process, we 
will write it is infinitely extendible (IE). When S = {0,1}, P{S) can be iden¬ 
tified with [0,1] and a probability measure on P{S) can be identified with a 
random variable W taking values in [0,1]. The kih. moment, E\W^], is then 
the probability that the first k random variables are 1. If one can extend a 
finite exchangeable sequence X = (Xi,..., Xn) to an infinite one, there may 
be more than one extension and so the distribution of W is not unique. See 
[13] for some discussion concerning this point. Any W which can be used 
will be called a representing W for X. 

Surveys of exchangeability can be found in [2] and [4]. The problem of 
determining when finite exchangeable sequences can be extended to infinite 
exchangeable ones has attracted some attention in the past; see the two 
above references as well as [21] and [23]. In the present paper, we will study 
this question in the mean field statistical mechanics context and also study 
how much one can extend if IE fails. 

We start by stating a known result where jS*] =2. Consider the Curie- 
Weiss Ising model with parameters J and h representing the coupling con¬ 
stant and the external field. This is simply the Ising model on the complete 
graph with symmetric 1 and 2 body interactions. See [6] or [19] for an in- 
depth discussion of this model. This is the probability measure on {Tl}” 
where the probability of the configuration a is proportional to where 

n T / ^ \ ^ 


( 1 . 2 ) 
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The model is said to be ferromagnetic if J > 0. The following result is proved 
in [20] where the technique is credited to [11]. 

Theorem 1.1. For n> I, J > 0 and any h, the Curie-Weiss Ising 
model with parameters n, J and h is IE. 

Remarks. When [S'! = 2, a necessary condition for being IE is that the 
process be associated (i.e., increasing events are positively correlated, see 
Definition 2.11 on page 77 of [15]); this follows from an obvious generalization 
of Proposition 2.22 on page 83 of [15]. In fact, it is not hard to check that IE 
even implies the stronger EKG lattice condition (see (2.13) on page 78 of [15] 
for this definition). Using this, it is easy to check that for n > 2 and J < 0, 
the model is not IE. We point out however that an elementary example 
in [16] shows that this EKG lattice condition plus finite exchangeability is 
not sufficient for being IE. An alternative way to see that IE implies J > 0 
is as follows. We have IE if and only if there is a random variable 0 < IT < 1 
so that 

h{2k—n)+(J/2)(2k—n)^ 

(1.3) EW'^il-W)^-’^ = —^ - 0<k<n. 

Holder’s inequality then gives 

EW^{1 - W)^~^ < (E1U”)^/”(E(1 - 

from which it is easy to deduce that J > 0. 

The following result extends Theorem 1.1 to a large number of models 
which have a quadratic representation for their Hamiltonian. It is proved 
using a similar method to that in [20]. Let n be an arbitrary probability 
measure on satisfying 

(1.4) [ e'’'^ dv{y) < oo 

Jr-’ 

for all V € R^. Next assume that n and n are such that 

(1.5) Z^^n-= [ ... ,Xn) < oo. 

Then we can consider the probability measure on (R^)'^ which is 
absolutely continuous with respect to with Radon-Nikodym derivative 
at {xi,...,Xn) given by 
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Theorem 1.2. For any v and n satisfying (1.4) and (1.5), Hu,n, viewed 
as the distribution of a finite exchangeable collection of n random variables 
taking values in W, is IE. 

Although the above formulation is very simple, many models (including 
the Curie-Weiss Ising model) fall into this category as we now briefly dis¬ 
cuss. 

• Curie-Weiss Potts model: 

Let q be an integer larger than 1, S' = {1,...,g}, J £ R and h:S ^ R. The 
Curie-Weiss Potts model with parameters n, J and h is the probability 
measure on S” where the probability of the configuration a is proportional 
to e^^^\ where 



The model is said to be ferromagnetic if J > 0. (Note q = 2 is equivalent to 
the Curie-Weiss Ising model.) 

• Heisenberg model: 

Let r be a nonnegative integer, S the r-dimensional sphere, J £ R and 
h'.S^R. Letting dx denote “surface area” on S, the (classical) Heisen¬ 
berg model with parameters n, J and h is the probability measure on S” 
whose Radon-Nikodym derivative with respect to dx®^ at a = (cJi,... ,an) 
is proportional to , where 



The model is said to be ferromagnetic if J > 0. (Note r = 0 is equivalent to 
the Curie-Weiss Ising model.) 

• Curie-Weiss clock model (see [8]): 

Let q be an integer larger than 1, S' be g points on the unit circle with 
constant spacing, J £ R and h: S ^ R. The Curie-Weiss clock model with 
parameters n, J and h is the probability measure on S” where the proba¬ 
bility of the configuration a is proportional to , where 



The model is said to be ferromagnetic if J > 0. (Note again q = 2 is equiva¬ 
lent to the Curie-Weiss Ising model.) 

Corollary 1.3. For any n, J >0 and h:S —> i?, the Curie-Weiss 
Potts model, the Curie-Weiss Heisenberg model and the Curie-Weiss clock 
model are IE. 
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Remarks. The fuzzy Potts model is obtained from the Potts model de¬ 
fined above by partitioning the possible spins into two sets. Haggstrom [10] 
proved that the ferromagnetic fuzzy Potts model (with no external field) 
on any graph has positive correlations and in fact satisfies the FKG lattice 
condition. We point out that for the special case of the Curie-Weiss Potts 
model (i.e., on the complete graph), this follows from Corollary 1.3 by using 
(i) the trivial fact that such a “projection” of an IE system is IE and (ii) IE 
systems with [S'] = 2 satisfy the EKG lattice condition. This same argument 
shows that if we take the ferromagnetic Heisenberg model on the complete 
graph on n vertices and partition the sphere into two arbitrary measurable 
sets, then the induced measure on {0,1}” satishes the FKG lattice condition; 
this does not appear to be obvious directly. 

It seems reasonable to ask whether all “ferromagnetic” systems on com¬ 
plete graphs are IE. One problem with this is that it is not clear exactly 
which systems should be considered ferromagnetic. We first consider the case 
when [S’] = 2 but where we add 3-body interactions. Consider the probabil¬ 
ity measure ^ih,J 2 ,Jz,n on {± 1 }” where the probability of the configuration 
a is proportional to , where 

( 1 . 6 ) H{a) = h±a.+ ^M±a}\\M±a}\\ 

2=1 \ 2=1 / \ 2 = 1 / 

This is the Curie-Weiss Ising model with an additional 3-body interaction 
term. The system is ferromagnetic (as defined in Chapter 4 of [15]) if and 
only if J 2 , J 3 > 0. However, if n = 3, for example, and h and J 2 are fixed, 
then for J 3 sufficiently large, we have 

P{Xi = 11 X 2 = -1,^3 = -1) > P{Xi = 11 X 2 = -1,X3 = 1). 

This implies that the FKG lattice condition fails and so (Xi,X 2 ,X 3 ) is 
not IE. A similar argument works for any fixed n > 3 or for J 3 sufficiently 
negative and also shows that for any n > 3, if J 2 = 0 and J 3 7 ^ 0, then IE 
fails. 

We now restrict to only 2-body interactions but general S. For 2-body 
interactions, one might define ferromagnetic to mean that the 2 -body inter¬ 
actions are of the form f{x ■ y) where / is an increasing function. However, 
it turns out that a system which has only 2 -body interactions of this form 
need not be IE. 

Proposition 1.4. For every n, there is an increasing function f so 
that if S = {{1,0), (0,1), (—1,0), (0,-1)} or 5 = {—1,0,1}, then the finite 
exchangeable probability measure on given by the Hamiltonian 

( 1 - 7 ) H{a)= fin-CTj) 

l<i<j<n 


is not IE. 
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Remarks, (i) The first S shows that we can take the spin values to have 
length 1 while the second S shows that we can take the spin values to be a 
subset of R. 

(ii) It would be of interest to investigate whether IE would follow if one 
assumed that / had some higher-order monotonicity. 

For the rest of the results, we continue to restrict to [S'! = 2. Unfortunately, 
we need to break the class of finite sequences (Xi,..., X^) which are IE into 
two classes. 

Definition 1.5. If {Xi,... ,X„) is IE, we call it type 1 if there exists a 
random variable W which represents {Xi,... ,Xn) and is in (0,1) a.s. It is 
called type 2 otherwise [i.e., if every representing W satisfies P{W G {0,1}) > 
0 ]. 


Remarks. A trivial example of type 2 is where P{Xi ^ X 2 ) = 0. In this 
case, a representing W is trivially unique and has P{W G {0,1}) = 1. One 
can check that for n = 2 this is the only type 2 situation. However, for n = 3, 
we have a less trivial example of type 2 where {Xi,X 2 ,X 3 ) is represented 
by a IT satisfying P(W = 0) = P{W = 1/2) = 1/2. In this case, if V were 
another representing random variable (meaning that the first 3 moments are 
the same as those for W), then one can check that — (1/2)U^/^)^] = 

0. This implies that — (1/2)) = 0 a.s. which forces V to have the 

same distribution as W. It is also trivial to find type 1 A’s which have a 
representing W satisfying P{W G {0,1}) > 0. 

Concerning the problem of determining whether a given finite exchange¬ 
able process is type 1 IE, we mention the following characterization which 
will be used in the proofs of Propositions 1.7, 1.10 and 1.12. 

Proposition 1.6. Let = {Ai = ■ ■ ■ = X^ = l,Xk+i = 0,..., = 0}. 

Then {Xi,... ,Xn) is IE of type 1 if and only if there exists a random variable 
f and c> 0, so that 

cP{Ek) = 

for k = 0,... ,n. 

Remark. We observe that the latter is also a type of moment problem, 
since this condition is the statement that P{E]f) is the kih. moment of 
where C’s distribution has Radon-Nikodym derivative with respect to 
the distribution of 

Using Proposition 1.6, we will obtain the following result which provides 
some further information concerning the Curie-Weiss Ising model with an 
additional 3-body interaction term. 
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Proposition 1.7. Consider the probability measure Hh,J 2 ,J 3 ,n correspond¬ 
ing to the Hamiltonian given in (1.6). For all /i, J 2 , J 3 with J 3 7 ^ 0 , there 
exists N sueh that for all n> N, pLh,J 2 ,J 3 ,n is not IE. 

This should be contrasted with the fact that for all /i, J 2 > 0 and n, there 
exists e > 0 so that for | J 3 I < e, yLh,J 2 ,J 3 ,n is IE; this follows readily from the 
alternative proof of Theorem 1.1 together with continuity. 

Proposition 1.7 might be viewed as unnatural for the following reason. 
As n increases, it is not so physically natural to keep the coefficients J 2/2 
and J 3/6 fixed but rather they perhaps should decrease with n and the 
appropriate Hamiltonian would be 

( 1 . 8 ) 

We do not know the answer to the following question. 

Question 1.8. Is it the case that for any h, J 2 and J 3 / 0, for all large 
n, the model using the Hamiltonian (1.8) is not IE? 

We have however the following two results related to this question where 
we take h = 0 for simplicity. (Proposition 1.12 below tells us that taking 
/i = 0 is no restriction.) 

Proposition 1.9. Consider the probability measure pij 2 ,j 3 ,n eorrespond- 
ing to the Hamiltonian given in (1.8) with h = 0. // | J 3 I > J 2 , then there 
exists N sueh that for all n> N, itjj, J 3 ,n is not IE. 

Proposition 1.10. Civen J 2 and J 3 7 ^ 0 , for only finitely many even 
values of n can the system on the complete graph on n vertices with Hamil¬ 
tonian 

(!■») 

be IE. 

To see in another way the degree to which the 3-body interaction term 
hinders being IE, we look at n = 4. In this case, one can check that when 
the Hamiltonian is taken to be 
4 

h 'y T J 2 ^ ( CTiCTj + ^ ( O'iO'jO'}- 

i=l l<i<j<k<A 
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the system is IE if and only if J 2 > 0 and 

cosh( 8 J 3 ) < cosh(4J2) — 2 e“^‘^^(sinh( 2 J 2 ))^. 

This latter condition involving the hyperbolic functions comes from con¬ 
sideration of the moment characterization (2.2) of IE. Erom this one can 
conclude after some computation that if J 2 and J 3 both approach 0 with 
JI/J 3 approaching c, then if c > 1/2, the system is eventually IE while if 
c < 1/2, the system is eventually not IE. Note the difference in the exponents 
(3 versus 2). 

The earlier Proposition 1.6 has other applications as well. 

Definition 1.11. If /r is a finite exchangeable probability measure on 
{±1}”, we let Tj/i(/r) be the probability measure on {±1}” which gives a 
configuration a probability proportional to where 

n 7 / ^ \ ^ 

Of course, if /r is uniform distribution, then Tj^h{fj,) is just the Curie-Weiss 
Ising model. One might call Tj^h(fi) a “(J,/i)-Isingization” of /i. 

Proposition 1.12. If the probability measure ja on {±1}” is IE then 
for all J >0 and h G R, Tj^hih) o,lso IE. 

Remark. This tells us that when studying the question of which models 
are IE, we can assume that there is no external field. 

Given a finite exchangeable sequence X = (2fi,..., Xn) which is not IE, it 
is interesting to ask if it can be extended to a finite but longer exchangeable 
sequence. 

Definition 1.13. Eor I > n, a finite exchangeable sequence X = 
(Xi,... ,Xn) is Eextendible if there exists a finite exchangeable process 
Y = {Yi)i<i<i such that X and {Yi,... ,Yn) have the same distribution. 
We let E{n,l) denote the collection of finite exchangeable processes X = 
{Xi, ..., Xn) which are Eextendible. 

Remarks. It is not hard to show that X = (Xi,... ,Xn) is IE if and only 
if it is Eextendible for every I > n. When S is compact, this follows from an 
elementary compactness argument. If S is not compact, a similar “tightness” 
argument can be easily carried out. In view of this fact, to prove the Curie- 
Weiss Ising model is extendible to an infinite exchangeable process, it would 
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suffice (assuming for this discussion that h = 0) to show that for any n and 
J and any I > n, there exists J' = J'{n,J,l) so that the projection of the 
Curie-Weiss Ising model on the complete graph of size I with parameter J' 
to n vertices is the Curie-Weiss Ising model on the complete graph of size n 
with parameter J. However, this is typically not true (it is however true for 
small n) and hence this approach to proving Theorem 1.1 does not work. 

Proposition 1.12 says that for J > 0 and any h, leaves E{n,oo) in¬ 
variant; the following is an interesting complement to this which says that 
this is false for finite I > n even when J = 0. 

Proposition 1.14. Given 2 < n < I < oo, there is a fi £ E{n,l) with full 
support and h£ R such that ^ E{n,l). 

Remark. In contrast to the remark after Proposition 1.12, when we are 
asking about finite extensions, we cannot assume that there is no external 
field. 


We finally consider the Curie-Weiss Ising model with J < 0; this is the 
antiferromagnetic case. With the exception of Proposition 1.20, the rest of 
the results concern only the antiferromagnetic Curie-Weiss Ising model with 
parameters J < 0, /i and n as defined in (1.2). As observed following the 
statement of Theorem 1.1, the Curie-Weiss Ising model in not IE in this case. 
The following result gives us some very precise information concerning how 
far one can extend the model when J is very close to 0. It will be convenient 
for our purposes to use a different parameterization for the Curie-Weiss Ising 
model. Given parameters a > 0,6 > 0, we let the Gibbs state /r on {0, !}"■ 
have probabilities of the form p,{r]} = /sn, where k is the number 

of I’s in the conhguration jy and the normalizing constant is given by 





gfl)Hn-k) _ 


An easy computation shows that our two parameterizations are related by 
a = b = e~‘^^ and so 6 < 1 corresponds to J > 0. The following gives 
conditions under which it is Textendible for large 1. Since we will be letting 
6^1 and the Curie-Weiss Ising model with 6 = 1 is the product measure 
with density a/(l +a), it is natural to denote p:=a/(l + o). 


Theorem 1.15. Let n > 2. Let p be as above and consider the Curie- 
Weiss Ising model on {0,1}"' with n and a fixed, and let b=l + {c/l). 

(a) If c < l/(2/?(l — p)), then the Curie-Weiss Ising model with parame¬ 
ters a and b on {0,1}” is I-extendible for all sufficiently large 1. 
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(b) If c> l/{2p{l — p)), then the Curie-Weiss Ising model with parame¬ 
ters a and b on {0,1}” is not l-extendihle for all sufficiently large 1. 


Remarks. Regarding the remark after Proposition 1.14, we see here that 
the external field is indeed relevant to the finite extension problem and is 
even relevant for the I —> oo asymptotics. Note also the monotonicity in a in 
the above result. This monotonicity however only holds in the asymptotics. 
Looking at the cases n = 2,3 and I = 4 discussed at the beginning of Section 
4, one sees that such monotonicity does not hold for finite n and 1. In fact, 
for n = 4, ^ = 5 and certain values of a, the set of Rs for which one can extend 
is not even an interval. We mention here that the argument involving (b) is 
considerably simpler than that for (a) since this part comes down to showing 
the nonexistence of a particular random variable by demonstrating that its 
first and second moments would not satisfy the Cauchy-Schwarz inequality. 

Further remarks. One might reasonably ask how big I needs to be in part 
(a) of Theorem 1.15. The answer undoubtedly depends in a complicated way 
on n, c and p. For small n, one can say something about this using the criteria 
given in Section 4. If n = 2 and 2p{l — p)c < 1 then the model is Textendible 
for all I >2. If n = 3, the answer is the same if 2p{l — p)c < 1 and p is close 
to 1/2. However, if one takes pfl and c t oo with 2p(l — p)c = d G (0, 1), 
then the condition for /-extendibility is asymptotically 


l> 


1 [ 

— - 7 max 1, 

2(1-P) 


d^ 1 
1-d 


One of the key steps in the analysis of the above is a solution to a new 
discrete finite moment problem. We first recall that determining whether an 
infinite sequence of numbers in [0,1] is the sequence of moments of some ran¬ 
dom variable taking values in [0,1] is called the Hausdorff moment problem 
for which a well-known sufficient and necessary condition is known. Con¬ 
ditions are also known which insure that a finite sequence of numbers in 
[0,1] can be extended to a moment sequence. This will be used to obtain 
an alternative proof of Theorem 1.1. The following theorem is a solution to 
a discrete moment problem. It is the key to proving Theorem 1.15 and we 
believe it to be of independent interest. 


Theorem 1.16. Given vi,... ,Vn, there exists a ... ,1} -valued ran¬ 
dom variable N satisfying 

(1.10) Vk=EN^ fork = l,...,n 

if and only if cq -|- CiVi > 0 for all polynomials P{x) = J22=o of de¬ 
gree n that have n simple roots in {0 ,... ,1} and are nonnegative on {0,..., Z}. 
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Remarks. For the case I = oo, such problems have been studied; see Chap¬ 
ter VII in [14]. In this sense, the above result is not such a large departure 
from known results. However, the technical result we use to verify the con¬ 
dition of Theorem 1.16 (which is contained in Theorem 4.3) is significantly 
different from that which appears in the treatment of earlier moment prob¬ 
lems. 

The proof of Theorem 1.15 is an existence proof and does not give a 
“formula” for the distribution of the extension. In Section 5, we will, for each 
n, I, J < 0 and h give a formula for the distribution of the number of I’s in 
the extension. However, this distribution might be a signed measure in which 
case the formula is of course not valid. Nonetheless, if it is a distribution, then 
the formula will be correct. The motivation for this approach comes from 
trying to extend the first proof of Theorem 1.1 (either the one from [20] or 
the one that comes out of the proof of Theorem 1.2) to the antiferromagnetic 
situation. 

Note that an exchangeable measure on {0,1}"' can be identified with a 
probability measure on {0 ,..., nj which gives the distribution of the number 
of I’s. If {Yi ,..., Ym') is an extension of (Vi,..., Xm), both exchangeable, 
with corresponding distributions ^m' on {0,..., M'j and hm on {0,..., Mj, 
it is trivial to check that 

/M\ 

( \ \m)\m'—m) / /\ 

). 

m' \m') 

In this case, we say that hm is the hypergeometric projection of //m' ■ 

If J > 0 and fj is the density function for a normal random variable with 
mean 0 and variance J, it is easy to check that 

/ OO 

(e(“+^)+e-(“+^))"/j(x) dx 

-OO 

is the normalization for the Curie-Weiss Ising model on n vertices with pa¬ 
rameters —J and h. In the next result, for M > 0, (~J|[^) := 


Proposition 1.17. 


(1.11) Q{j):= 


Z„ 


Fix n, J > 0, /i > 0 and I > n. Let 

^ ^ f -{I-n)\ -{J/2)(2m+2l-n-2j+h/J)^ 

V j/ ^ \ / 

^ m=0 ^ ^ 


for j £ {0,... ,1}. Then 

( 1 . 12 ) 




0<A:<n 


is the probability measure on { 0 ,...,n} corresponding to the Curie-Weiss 
Ising model with parameters n, — J and h. [This implies that ^jQ(j) = 1.] 
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Hence if Q{j) >0 for each j, then the finite extension exists and Q provides 
a formula for the extension. 

Remarks. We point out here that we know that there are cases, including 
when h = 0, where the extension exists but where Q{j) <0 for some j and 
hence the formula is invalid. On the other hand, if Q{j) > 0 for all j, then it 
is easy to see, using the fact that the hypergeometric projection is a linear 
mapping of full rank, that there are many other /-extensions besides that 
given by Q. (Proposition 1.20 is proved along these lines.) 

The above theorem will help lead to the following two results and also 
gives us our first cases with n < I where Q > 0 . 

Proposition 1.18. Fixn, I and h. Then the Curie-Weiss Ising model 
with parameters J and h on { — 1,1}” is l-extendible for all J if and only if 
n is odd, I = n + l and h = 0. Moreover, in this latter case, we always have 
Q >0 and so provides a formula for the extension. 

Remark. The following suggests why one might have expected part of 
the result stated in Proposition 1.18. If we take /i = 0 and J = —oo, then 
the Curie-Weiss Ising model corresponds to uniform distribution on subsets 
of “half” the vertices. It is clear that the projection of this distribution on 
even n to n — I is simply this distribution on n — I while it is easy to see 
that this distribution on even n cannot come from any distribution on n -|- 1 . 

Proposition 1.19. For all n, there exists e > 0 such that for all J 
with \J\ < £, the Curie-Weiss Ising model with parameters h = 0 and J on 
{— 1 , 1 }"" is n + l-extendible and Q >11 and so provides a formula for the 
extension. 

Remark. The key part of the above result is that this is another case 
in which (1.11) yields a formula for the extension. The weaker fact that the 
system is n -|- l-extendible for small J follows from a much more general 
result. This is stated in the following easy proposition. 

Proposition 1.20. Fix n < I < oo and let X = {Xi,... ,Xn) be i.i.d. 
0,1 valued with P{Xi = 1) = p G (0,1). Then any small perturbation of X 
which is in E{n,n) also lies in E{n,l). This is false ifp = 0 or 1. 

Our final theorem gives us a very large number of cases where our for¬ 
mula is valid. These cases are qualitatively similar to those covered in The¬ 
orem 1.15 but quantitatively different. 
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Theorem 1.21. Let n>l and c > 0. Define 

1 


a{c) := c + cosh 


-1 


— ctanh 


cosh 


-1 


1 


(l_e-i/c)i/2;j 


(l_e-i/c)i/2 

/3(c) := ln(-v/c + Vc — 1) + c — (? — c {defined only for c > 1) 

and 

{ max{c, Q!(c)}, if cG {0,1], 

max{Q:(c),/3(c)}, //cG (1,3/2), 

/3(c), ifc>3f2. 


Then, for all c> 0 and h > h*{c), if J = c/l, I is sufficiently large and Q is 
defined as in (1.11), we have that Q >0 and so provides a formula for the 
relevant extension. [Recall the hypergeometric projection of Q is the Curie- 
Weiss Ising model with parameters n,—J and h as defined in (1.2).] 


Remarks. The parameterizations here and in Theorem 1.15 are different 
and it is easy to check that a value of c in Theorem 1.15 corresponds to 2c in 
the above result. In Theorem 1.15, it is easily checked that c < l/(2p(l — p)) 
provided that (i) c < 2 (no matter what h is) or (ii) c > 2 and \h\ > ln( 1 / 0/2 + 
\/c/2 — 1). Hence, in view of the above relationship between the c’s in the 
two results and the fact that limc^oo |/3(c) — ln(-/c + -/c^n))| = 1/2, we 
have that for large c (equivalently large h), the two conditions are not too 
far apart. We are not sure whether to believe that in the asymptotic regime 
where Theorem 1.15 guarantees an extension we also have that Q >0. At 
the same time, we are quite sure that the bounds given in this theorem are 
not sharp. In this theorem, it is also possible to take n growing to 00 as long 
as n = o{\tl) but we do not bother to elaborate on this. 

The rest of the paper is organized as follows. In Section 2, we first give 
proofs of Theorem 1.2 as well as Corollary 1.3. We then give an alterna¬ 
tive proof of Theorem 1.1. For this alternative proof, we will use a known 
sufficient condition for when a given finite sequence extends to an infinite 
moment sequence. Often such conditions, while being of theoretic interest, 
are difficult to apply; we find it interesting that such a condition can be 
checked in this concrete situation. We point out that the alternative proof 
of Theorem 1.1 is not as unrelated to the proof of Theorem 1.2 (in the case 
of the Ising model) as may at first seem. In the first proof, one explicitly 
uses the fact that e* is a moment generating function. Had we not known 
this, we would have to check it by verifying certain conditions, which would 
probably involve positivity of certain determinants which is the approach of 
the alternative proof. In addition, this alternative proof is the basis of the 
proof of Theorem 1.15. Finally, we give the proof of Proposition 1.4 in this 
section. In Section 3, we will give the proofs of Propositions 1.6, 1.7, 1.9, 
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1.10, 1.12 and 1.14. In Section 4, the proofs of Theorems 1.15 and 1.16 will 
be given. Finally, in Section 5, we prove Propositions 1.17, 1.18, 1.19 and 
1.20 as well as Theorem 1.21. 

2. Hamiltonians with qnadratic representation are IE. We first prove 
Theorem 1.2. 


Proof of Theorem 1.2. Let P be a random variable in whose 
density (with respect to s-dimensional Lebesgue measure) is given by 

, , (/^.e--dKx))"e-ll-llV 2 

^ (2vr)P2 ’ 

where is given in (1.5). (It is easy to check that is the correct 
normalization to yield a probability density.) For w G R^, let Pu, be the 
probability measure on R^ which is absolutely continuous with respect to ly 
with Radon-Nikodym derivative 

dP^jx) 

di^{x) ' e'^-y du{y)' 

[Note the denominator is hnite by (1.4).] Letting Fy be the distribution of 
V, we claim that 


f^u,n= [ PTdPviw) 
Jr‘ 


which proves the result. To see this, note that the right-hand side clearly is 
absolutely continuous with respect to with Radon-Nikodym derivative 
at {xi,...,Xn) given by 


nr=i 


oW-Xi 


Ir- Urs e^-ydi^iy))' 


■ dPy (w) = 


^i/,n(2vr)®/2 Jrs 


E n 


dw 


which, using the formula for the moment generating function for a multidi¬ 
mensional standard normal random variable, is 




,( 1 / 2)11 




z, 




as desired. □ 


Proof of Corollary 1.3. We assume throughout J > 0 as otherwise 
the results are trivial. 

Curie-Weiss Potts model: Let the parameters g, J > 0 and h be given. Let 
s = q, choose q orthogonal vectors Q = {ui,..., ag} on the sphere {x: ||a:|| = 
\/j} in Ry and let be the probability measure on R^ concentrated on Q 
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giving Oj weight proportional to It is easy to check that the corre¬ 

sponding measure fiu,n is exactly the Curie-Weiss Potts model where a* is 
identified with i. 

Heisenberg model: Let the parameters r, J > 0 and h be given. Let 
s = r + 1 , consider the sphere {x: ||x|| = \/j} in and let v be the proba¬ 
bility measure on concentrated on this sphere whose density with respect 
to surface area is proportional to _ it is easy to check that the cor¬ 

responding measure is exactly the Heisenberg model with the sphere 
{x: ||x|| = y/J} trivially identified with the unit sphere. 

Curie-Weiss clock model: Let the parameters g, J > 0 and h be given. 
Let s = 2 and choose q equally spaced points Q = {ai,..., aq} on the circle 
{x: ||x|| = y/j}. Let v be the probability measure on R^ concentrated on 
Q giving Oj weight proportional to It is easy to check that the corre¬ 
sponding measure is exactly the Curie-Weiss clock model where a* is 
identified with i. □ 

We now give an alternative proof of Theorem 1.1. For this proof, we prefer 
to use the second parameterization given right after Proposition 1.14. 

Alternative proof of Theorem 1.1. Define sequences and 
for 0 < A: < n by 

and 



Then (1.3) is equivalent to 

(2.1) EW^ = Vk, 0<k<n. 

Thus our problem is reduced to determining whether vq,. .. ,Vn can be ex¬ 
tended to the sequence of moments of a random variable 0 < kF <1. There 
is a classical solution to this problem. See [22] or [24] for a more recent dis¬ 
cussion. The condition for extendibility is a bit different depending on the 
parity of n, so we assume from now on that n is even, and write n = 2m. 
The odd case is similar. 

A sufficient condition for the existence of 0 < VF < 1 satisfying (2.1) is 
that the following two matrices be (strictly) positive definite: 


/ Vo 

Vl 

Vm \ 


/ Wl 

W2 

Wm \ 

Vl 

V2 


and 

W2 

W3 

VJm+1 

\Vm 

Vm+l 

Vn / 


Kwm 

VJm+l 

■ Wn-1 / 
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where Wk = Vk — Vk+i- We will consider only the first of these, since the 
treatment of the second is similar. 

A matrix is positive definite if and only if its principal minors are all 
strictly positive. This is a standard result in linear algebra. It is usually 
proved by induction. However, in Proposition 4.2, we will have a result that 
yields an immediate proof of this fact. Usually, one thinks of the principal 
minors as being the determinants of the submatrices that are situated in the 
upper left corner of the matrix. However, it is much more convenient for our 
purpose to consider the ones that are situated in the lower right corner of 
the matrix. We will use the following notation: 


j=0 


ui-j, 


0<k<l<n. 


Thus Vn-k = f{k,n). With this notation, we must prove the strict positivity 
of 


(2.3) 


'^n—2k 

'^n—k 


f{2k,n) • 

•• f{k,n) 

'^n—k 

Vn 


f{k,n) • 

•• /(0,n) 


k = 0,... ,m. 


We begin by performing some row operations. We will use repeatedly the 
easily verified relation f{k, 1) — f{k — l,l) = f{k — 1, / — 1). Now, subtract the 
second row from the first, then the third from the second, ..., and finally 
the last row from the kth row. The result is that (2.3) equals 

f{2k-l,n-l) f{2k-2,n-l) ••• /(A:-l,re-l) 

f{k,n-l) f{k-l,n-l) ••• /(0,n-l) 

f{k,n) f{k-l,n) ••• /(0 ,n) 

Repeat this process a number of times, each time using one less row than 
the previous time. The result is that (2.3) equals 

f{k,n-k) ••• f{0,n-k) 


f{k,n) 


/(0,n) 


Finally, repeat this whole procedure using columns instead of rows. The 
result is that (2.3) equals 


(2.4) 


Note that up to this point, we have not used the particular form of the 
Ufc’s. The statement that the left-hand side of (2.3) equals the right-hand 
side of (2.4) holds for any finite exchangeable measure. 


/(0,re-2fc) • 

■ f{0,n-k) 


'^n—2k 

'^n—k 

f{0,n-k) ■ 

/(0,n) 


'^n—k 

Un 
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Now we do use the particular form for the Uk’s, and assume that 6 < 1, 
since when b = 1, rji,... ,r]n are i.i.d., so its extendibility is immediate. Noting 
that one can factor out powers of Sn, a and b, write the right-hand side of 
(2.4) as 


„fc+i 


^n—2kj^2k{n—2k) 

^n—kj^k{n—k) 


^n—kj^k{n—k) 


a^b° 


^{k+l){n-k) 

Ifc+i 


^2k{n—2k) 

^k(n—k) 


^k{n—k) 

b^ 


g^{k+l){n—k) ^k{k+l){3n—2k—l)/3 


„fc+l 

i>n 


b-2k^ 

••• 6-4^ 

b-2k 

1 

b-Ak 

1 • 

00 • 

5-4 

1 

b-2k 

... 6-4 

5-2 

1 

1 

1 

1 

1 


This last determinant is of Vandermonde form, so can be computed explicitly 
as 

0<j<l<k 

which is strictly positive if 6 < 1. □ 


We lastly give the proof of Proposition 1.4. 

Proof of Proposition 1.4. Fix n > 2. Let / satisfy /(I) = a„, /(O) = 

0 and /(—I) = —bn where is positive and small and where bn is positive 
and large. 

Case 1: 5 = {(1,0), (0,1), (-1,0), (0,-1)}. 

If the model were IE, then so would be the process on {0,1}”', called 
(Yi,..., Yn), obtained by partitioning S into {(1,0), (—1,0)} and {(0,1), (0, —1)} 
and letting the first set correspond to 1 and the second set to 0. This lat¬ 
ter measure would then satisfy the FKG lattice condition which we now 
show it does not. It is easy to check that if an is sufficiently small and bn is 
sufficiently large, then for any [u 2 , ■ ■ ■ ,Un) C {( 1 , 0 ), (— 1 , 0 )}”“^, 

P{Yi = l\X2=U2,...,Xn = Un) < 1/2 

which implies 


P(yi = 11^2 = 1,...,Y„ = 1)< 1/2. 
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By symmetry we have 

P(Fi = 11^2 = 0,...,i; = 0)> 1/2, 

which then violates the FKG lattice condition. 

Case 2: S’= {-1,0,1}. 

Consider the process on {0,1}”, called (Yi,..., Yn), obtained by partition¬ 
ing S into {1,-1} and {0} and letting the first set correspond to 1 and the 
second set to 0. It is clear that 

p(yi = i|Y 2 = o,...,y„ = o) = 2/3 

for any choice of and On the other hand, it is easy to check that if 
Qn is sufficiently small and bn is sufficiently large, then 

P(Yl = 11^2 = l,...,yn = l)< 0.51 

leading to a similar contradiction as in case 1. □ 

3. Moment generating fnnctions, 3-body interactions and Isingization. 

3.1. Moment generating functions and type 1 IE. In this subsection, we 
prove Proposition 1.6. 


Proof of Proposition 1.6. For the “if” direction, let ^ be as given 
and let Y be the random variable whose distribution is absolutely continuous 
with respect to the distribution of f with Radon-Nikodym derivative given 
by 

(2 cosh x)" 
h ’ 

where h is £'[(2cosh^)"']. Note b <oo since < oo for k = 0,... ,n 

and so Y is well defined. Let W = e^/(2coshy) [which is in (0,1)]. Then 
for k = 0,... ,n, 




-gfcTg-(n-fc)y- 

( 2 coshy)"' 


^[gfcSg-(n-fc)^] cP{Ek) 
b “ b 


[Clearly the last term is then just P{Ek), concluding that c = 6 .] This shows 
fj, is extendible to an infinite exchangeable process with a mixing variable 
W a.s. contained in (0,1). 

The only if direction is more or less obtained by going backward. Choose 
a random variable W contained in (0,1) a.s. such that 

E[W'^{l-W)^-’^]=P{Ek) 

Y 

for A: = 0,..., n. Let Y be the random variable defined by VF = 2 cLhy [bere 
W G (0,1) is being used]. Let be the random variable whose distribution 
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is absolutely continuous with respect to the distribution of Y with Radon- 
Nikodym derivative given by 

1 

6(2 cosh x)"- ’ 

where 6 is -£'[ ( 2 coshy)"» ]' Since cosh is bounded away from 0 , ^ is well dehned. 
We then have for k = 0,... ,n 

^(2k-n)V n ^ E[W^{1 - W)^-^] _ P{Ek) 
6 ( 2 coshy)”' 6 6 

(In this case, 6 = 1/c.) □ 

Remark. It can be shown that in Theorem 1.1, the ^ in the above result 
is simply a normal random variable with mean h and variance J. 

3.2. Curie-Weiss Ising model with 3-body interactions. In this subsec¬ 
tion, we prove Propositions 1.7, 1.9 and 1.10. 

Proof of Proposition 1.7. Fix h, J 2 and J 3 / 0. Choose N so that 
for all n> N, the function 

/ := hx-L 

2 D 

defined at the points x = —n, —n -|-2 ,...,n — 2 ,n does not extend to a convex 
function on [—n, n]. Such an N clearly exists by looking at —n, —n + 2, —n -|- 4 
if J 3 > 0 and at n — 4, n — 2, n if J 3 < 0. Fix n> N. We claim that Hh,J 2 ,J 3 ,n 
is not IE. Choose e > 0 so that any function g defined at the points x = 
—n, —n + 2 ,..., n — 2, n satisfying |/(x) — g{x)\ < e for each such x does not 
have a convex extension to [—n,n]. If fJ-h,J 2 ,J 3 ,n is IE, let IF be a representing 
random variable. By perturbing IF a little bit, we can obtain a random 
variable IF^ taking values in (0,1) with 

(3.1) \log[n{Ek)] - log[fih,J 2 ,J 3 ,ni^k)]\ < £ ior k = 0,...,n, 

where is the probability measure on {± 1 }” coming from the mixing vari¬ 
able IF' and Ek is as in Proposition 1.6. (This uses the fact that Hh,J 2 ,J 3 ,n 
has full support.) 

Since IF' takes values in (0,1), Proposition 1.6 tells us that there is a 
random variable ^ and c > 0 satisfying 

E[e( 2 '=-)€] = cii(Efc) 

for A: = 0,... ,n. Since a moment generating function exists on an interval 
and its logarithm is convex, we conclude that the function h{t) := log(E[e*^]) 
exists and is convex on [—n,n]. Note that its restriction to x G {—n,—n + 


^[g(2fc-n)5j ^ ^ 
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2,... ,n — 2,n} differs from the function by a constant. In 

view of (3.1) and the definition of fJ-h,J 2 ,J 3 ,n, we conclude that \h{x) — f{x)\ < 
£ (after a translation of / or h) for x G {—n, —n + 2,..., n — 2, n}. This is a 
contradiction. □ 


Proof of Proposition 1.9. Since J 3 > J 2 , it is easy to check that for 
large n the function 


J2 2 I *^3 3 

f ■= —X ^- 

2n 6n^ 


dehned at the points x = —n, —n + 2,..., n — 2, n does not extend to a con¬ 
vex function on [—n,n]. Prom here, one can simply carry out the proof of 
Proposition 1.7. □ 


Proof of Proposition 1.10. Fix J2 and J 3 / 0 and denote the rele¬ 
vant measure on {±1}"' by fin (ignoring explicit notation of the dependence 
on J 2 and J 3 ). For simplicity, we can assnme that for all even n, fin is IE. 
Fix n. If fin were of type 1, there wonld exist, by Proposition 1.6, a random 
variable Xn such that 

(3.2)E[e^^"] = e(-^2/(2n2))fc2+(j3/(6n3))fc3 for fc = -n, -n + 2 ,..., n - 2, n. 

Note Proposition 1.6 only says that the left- and right-hand sides are pro¬ 
portional but by taking A: = 0, we see that equality holds (this is why we 
take n even). Since fin need not be of type 1, we need to make a preliminary 
detour to (almost) obtain (3.2). We first find a type 1 IE measure fin,m on 
{±1}"' with \\fin,m — A^nll < 1/m where || • || is total variation norm. [This 
can be easily done by taking a representing random variable W for fin and 
perturbing it a small bit obtaining a random variable W taking values in 
(0,1).] By Proposition 1.6, there is a random variable Xn,m and Cn,m > 0 
such that 


(3.3) E[e^^"’"']=Cn,nifJ-n,m{Efk+n)/ 2 ) lov k =-u,-u + 2,... ,n - 2,n, 

where E^ is as in Proposition 1.6. Setting k = 0, we see that limm->oo Cn,m = 
[fin{En/ 2 )]~^ ■ It follows that 


(3.4) 


lim E\e^^"’^] 

m—j'OO 


A^rt(-^(fc+n)/2) ^ g(J2/(2n2))fc2+(j3/(6n3))fc3 

l-'‘n{En/2) 

for k = —n, —n + 2,... ,n — 2,n. 


Since 

(3.5) lim ^ ^ 

m-^oc ^ m—j'CO ^ 
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it follows that {Xn,m}m>i IS tight. We conclude that for some mi —> oo, 
Xn,mi Xn in distribution and (3.5) allows us to conclude (by uniform 
integrability) that 

(3.6) Urn for/c =-n + 2,..., n - 2. 

m-^oo 

Now, (3.4) finally allows us to conclude that 

( 3 . 7 ) E[e^^^] = e(‘^ 2 /( 2 n 2 ))fc 2 +(j 3 /( 6 n 3 ))fc 3 for fc = -n + 2 ,..., n - 2 

which is only slightly weaker than (3.2). 

Equation (3.7) now tells us that the sequence {nX„}„>i is tight and 
hence converges along a subsequence to X^o- For z G (—1,1), one can let 
kn/n approach z (with \kn\ <n — 2) and conclude (using | 2 :| < 1 implies 
uniform integrability) that 

E[e^^^] = e(‘^2/2)^'+(‘^3/6)z3^ ^ ^ 

The two sides are complex analytic functions in {z:\ Re(z)| < 1} and hence 
agree on the imaginary axis. It follows that 

_ g(-J2/2)t2-i(J3/6)t3 

for all t G ii. We claim that there is no random variable with this characteris¬ 
tic function when J 3 7 ^ 0. If there were, let Xi and X 2 be independent copies 
with this distribution and we conclude that Xi — X 2 is normal. Theorem 
19 of [3] however says that if a sum of two independent random variables is 
normally distributed, then so is each summand. This yields a contradiction. 
□ 


3.3. Results for Isingization. In this final subsection, we prove Proposi¬ 
tions 1.12 and 1.14. 

Proof of Proposition 1.12. We break the proof up into two steps. 
We hrst prove the result under the further assumption that is of type 1 
using Proposition 1.6 and then extend the result in general. 

Fix a probability measure /r on {±1}"' which is type 1 IE and let J > 0 
and h€ R. Let be the event that exactly the first k variables are 1. By 
Proposition 1.6, there exists a random variable ^ and c so that 

for /c = 0,... ,ra. Since Tj^h{R){Ek) is proportional to 

it follows that [Tj^/i(^)](Ffc) is proportional to e^( 2 fc-n)+(j/ 2 )( 2 fc-n) 2 ^jg( 2 fc-n)$j^ 

However, the latter is clearly equal to where U is a 

standard normal random variable independent of f,. By Proposition 1.6, we 
conclude that Tj^k{g) is IE (and of type 1 although we do not need that). 
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For the second step, consider a probability measure /i on {±1}” which is 
IE (perhaps of type 2). Let VF be a representing mixing random variable. If 
P{W £ {0,1}) = 1, the result is trivial. Otherwise let m be the conditional 
distribution of W given W ^ {0,1} and let v be the probability measure 
on {±1}"" given by a mixing random variable having distribution m. After 
some reflection, one sees that is a convex combination of Tj^h{v), di 

and 5_i where 5i is the measure concentrating on having only i’s. By the 
first part, we know that is IE and so we can conclude that 

is IE as well. □ 

Proof of Proposition 1.14. Eix 2 <n <1. E{l,l) can be identified 
with probability vectors {go, ■ ■ ■ ,gi) of length I + 1 where gi is the probability 
of having i I’s. Define such a probability vector by letting go = 5, gi = 0 for 
i = 1, ... ,l — n and gi = (1 — 6) jn for i = l — n + 1, ... where d > 0 will be 
determined later. Let now /r be the distribution of the first n variables of 
the I finite exchangeable random variables corresponding to this g. It is easy 
to check that g has full support. We now claim that for h sufficiently close 
to —oo and 5 sufficiently small, To^hig) ^ E{n,l). 

Let O := {|{i :Xi = 1}| = 1}. Let r'j be the probability measure in E{n,l) 
which comes from the probability measure in E{l,l) corresponding to the g 
with gi = 1. Since 2 < n < Z, it is clear that r'i(O) < 1 for each i and so we 
can choose e > 0 so that Pi(O) < 1 — e for each i = 0,..., Z. Since the natural 
map from E{l,l) to E{n,l) is affine, we conclude that p( 0) < 1 — e for any 
i' € E{n,l). However, it is clear we can choose h sufficiently negative and 5 
sufficiently small so that TQ^h{g){0) > 1 — e implying To^hig) ^ E{n,l). □ 

Remarks. 5 is only used to yield full support of g. In the above proof, the 
V in E{l,l) whose projection was the desired g did not have full support; by 
a small perturbation, one can also take this to have full support. There is 
another explanation of why the above result is true. E{n,n) is foliated by 
differentiable curves of the form {TQ^h{g))heR as g varies over E{n,n) and 
at the same time, E{n,n) is an n-dimensional simplex. This however is not 
a contradiction since the corner points of this simplex are fixed by and 
so these points reduce to curves having only 1 point. However, the polytope 
E(n,l) has many corner points which are not fixed by and the same 
is true nearby these points. If left E{n,l) invariant, then E{n,l) would 
be foliated by regular differentiable curves near these corner points which of 
course cannot happen. 

4. A discrete moment problem and finite extendibility. In this section, 
we prove Theorems 1.15 and 1.16. 

Einding necessary and sufficient conditions for Z-extendibility for general 
n and Z seems to be out of the question, since they would necessarily be 
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very complex. We illustrate this by stating the necessary and sufficient con¬ 
ditions for small n and 1. Checking these involves routine but somewhat 
tedious computations. One simply writes down the distribution of a gen¬ 
eral exchangeable measure on {0,1}^, and solves the equations that guaran¬ 
tee that the n-dimensional marginals are the ones that correspond to the 
Curie-Weiss Ising model. Then one determines conditions guaranteeing the 
feasibility of the resulting linear programming problem. Of course by sym¬ 
metry, one can go between the cases a < 1 and a > 1 by replacing a by 1/a 
and so we just take a < 1 throughout. Here are the results: 


n = 2, Z = 3: 


b<a+a ^ 

if a < 1, 

n = 2,Z = 4: 


b ^ ^CL “h ^Qj 

if a < 1, 

n = 2,Z = 5: 

6< 

j 2a + ga”^, 

\|a + |a-i, 

if a < 1/\/3, 
if l/\/3<a< 1, 

n = 3, Z = 4: 






b<l/\J a(l — a) 

if a < 1, 

n = 3, Z = 5: 

.<{ 

■l/Va(2-3a), 

V3/^a{2-a), 

if a < 1/2, 
if l/2<a<l 


n = 4, Z = 5: 


[ {^,bi{a)]U[b 2 {a),a + a ^], if a < oq, 

\ (0, a-|-o“^], if Go < o < 1) 

where oq = 0.477... is a root of 27a® — 148a® -|- 162a‘^ — 148a^ -|- 27 = 0 and 
6i(a) < 62(a) are the two real roots of + a?b‘^ — a6® -|- 1 = 0. Note 

that in the last case, the set of 6’s for which one can extend is not even an 
interval. Note also the difference between n = 3, Z = 4 and n = 4, Z = 5 when 
a = 1; in the first case 6 can be arbitrarily large, while in the second, it 
cannot be. These are of course just special cases of Proposition 1.18. 

For a more systematic approach, one can try to imitate the alternative 
proof of Theorem 1.1. That proof involved two elements: (a) de Finetti’s 
theorem, which reduces the extendibility problem to a moment problem, 
and (b) the solution to the moment problem given by (2.2). 
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The analogue of de Finetti’s theorem for finite extendibility is elementary 
and well known. (See page 536 of [7], e.g.) Suppose that are ex¬ 
changeable {0, l}-valued random variables and set N = Then for 

k<l 


m=i. 

i=k ^ 


C) 


EN{N -k + l) 


Thus the {0, l}-valued exchangeable sequence is /-extendible if 

and only if there exists a {0,1,..., /}-valued random variable N such that 


(4.1) 


P(Xi = l,...,Xfc = l) 


EN{N -k + l) 

/(/-1) •••(/-A:+ 1) 


for all l<k<n. 

To continue this program, we would need an analogue of (2.2) for 
{0,1,...,/}-valued random variables N. We are not aware of such a re¬ 
sult. However, one can modify the approach that led to (2.2) to solve this 
problem. We will now do so, following the development in [13]. 

Fix integers 1 < n < /. After a linear change of variables, our problem 
reduces to finding necessary and sufficient conditions on numbers ,..., 
so that there exists a {0,1,... ,/}-valued random variable N satisfying 

(4.2) Vk = EN'^ for fc = l,...,n. 


The first step involves some convex analysis. Let A1 be the set of all nonnega¬ 
tive multiples of vectors (1, fi,..., Vn)-, where vi,... ,Vn satisfy (4.2) for some 
{0,1,..., /{-valued random variable N, and let V be the set of polynomials 
P{x) of degree at most n that satisfy P{i) > 0 for z = 0,..., /. Both M. and 
V are closed convex cones. Clearly P €V if and only if it has degree at most 
n and EP{N) > 0 for all {0,..., /{-valued N. Writing P{x) = J27=o 
see that P G "P if and only if X]r=o — 0 (^Oj • • • j Vn) G Ad. In other 

words, V = A4*, where A4* denotes the dual of A4. A basic result in convex 
analysis (see, e.g., Theorem 4.1 in [13]) then implies that M.=V*. This 
means that (ui,..., Vn) can arise as in (4.2) if and only if cq -|- ^ 

for all P(x) = Ya=q Cix'" G V. 

It is sufficient in the last statement to consider only P €Ve, the extreme 
points of P. To help understand the structure of Pe, we state the following 
result. 


Proposition 4.1. Suppose P G P. Then P G Pg if and only if: 

(a) P has degree exactly n, 
and 

(b) all n roots of P are simple and are contained in {0,...,/{. 
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Proof. Suppose P G Vg- Write P{x) = Pi{x)P 2 {x), where Pi has only 
real roots, and P2 has no real roots. Then P2 is never zero, and we may 
assume that P 2 {x) > 0 on [0,/]. If P 2 is not constant, we may write 

P2(x) = {P 2 ix) + ex)/2 + (P2(x) - ex)/2, 

where the two polynomials on the right have degree at most that of P 2 {x) 
and are strictly positive on [0, 1] if e is sufficiently small. Then 

P{x) = Pi{x){P 2 {x) + ex)/2 + Pi(a;)(P2(a;) - ex)/2, 

which violates the extremality of P{x). Thus all roots of P{x) are real. 

Next, write P(x) = {x — xo)Q{x), where xq is one of the roots of P(x). 
If xo ^ {0,... ,1}, then (x — xq ± e)Q{x) are both in V for small e, so that 
extremality is violated again. Therefore, all roots are in {0,...,/}. 

If P(x) has degree less than n, then the representation 

lP{x) = xP(x) + {I — x)P(x) 

shows that P is not extremal. Therefore, any extremal P has degree n. One 
can rule out multiple roots at i for i = 0,1,... ,l in a similar way, by writing 

{ x(x — 1) + X, if z = 0, 

{x — i){x — i — l)/2 + {x — i){x — i + l)/2, if 1 < i < / — 1, 

(x — l)lx — I + 1) + {I — x), il i = l. 

For the converse, suppose P G P satishes properties (a) and (b) in the 
statement of the proposition. Write P = Pi + P2 for Pi,P2 G P. Then Pi 
and P2 have the same roots as P, and therefore must be positive multiples 
of P. This completes the proof of the proposition. □ 

Note that we have now established Theorem 1.16. 

Remarks. With the above proof of Theorem 1.16, we can remove some 
of the mystery surrounding conditions (2.2) for the solution to the finite 
moment problem for a random variable W taking values in [0,1]. The proof 
in the continuous case is identical to that in the discrete case up to the 
statement of Proposition 4.1. However, in that case, the polynomials P(x) 
in Pe have n roots in [0,1], but all interior roots must have even multiplicity 
since P(x) > 0 on [0,1]. If n is even, for example, it follows that the roots 
at 0 or 1 (if any) must have multiplicities that are either both even or both 
odd. If they are both even, then P(x) = [Q{x)]'^ for some polynomial Q(x), 
while if they are both odd, P(x) = x(l — x)[Q{x)]^. Writing Q{x) = 
replacing x by W, squaring out these expressions and taking expected values, 
one sees that certain quadratic forms in the c's with coefficients that are 
Vk’s must be nonnegative definite. This (more or less) translates into the 
positivity of certain determinants, which turn out to be the ones in (2.2). 
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If n = 2, the polynomials appearing in Theorem 1.16 are just positive 
multiples of x{l — x) and {x — i){x — i — 1) for z = 0,..., ^ — 1. Therefore, the 
possibilities for (^ 1 ,^ 2 ) are 

{(ui,U 2 ) ■■V 2 < Ivi and V 2 > 4>{vi)}, 

where 


d>(x) = max [(2z + l)x — + 1)1. 

^ ' o<i<z ■' 

To see the analogy with (2.2), note that if n = 2, it becomes 

V 2 > vl and V 2 <vi, 


and that 


lim (j){yl)/f =y‘^. 

1^00 

By (4.1), we see that a necessary and sufficient for /-extendibility of the 
Curie-Weiss Ising model with n = 2 is that there be a {0,1,... ,Z}-valued 
random variables N so that 

EN _ a? + ab EN{N -1) _ 

I a‘^ + 2ab + l l{l — l) a‘^ + 2ab + l 

By the previous development, this is equivalent to 

la{b + la) / la{a + b) \ 

+ 2ab + 1 “ + 2a6 + 1/ ’ 

which is in turn equivalent to 

la{b + la)> max [(2i + l)^a(o + 6) — i(i + l)(a^ + 2a6 + 1)], 


and hence to 
(4.3) 


b < min 


jl-i) 

2i 


a + 


^ +1 -lA 

2(^-z-i)“ ;■ 


(The inequalities corresponding to z = 0 and z = / — 1 are satisfied automati¬ 
cally.) Minimizing over continuous z rather than discrete z gives the following 
sufficient (and asymptotically necessary) condition for ^-extendibility of the 
Curie-Weiss Ising model when n = 2: 


6 < 1 + 


(l + «)^ 

2a{l - 1)' 


For n = 3, the polynomials appearing in Theorem 1.16 are x{x — i){x — 
z — 1) for z = 1,...,/ — 1 and {I — x){x — i){x — i — 1) for z = 0,..., ^ — 2. 
Therefore, the set of possible values of (z;i,z; 2 ,U 3 ) is given by 


{(z;i,z; 2 ,z; 3 ) :z ;3 > (l)i{vi,V 2 ) and 0 < us < (/> 2 (ui,^ 2 )}, 
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where 


(/)i(x,y) = max[(2i + l)y - i{i + l)x] 
l<i<l 


and 


(p 2 {x,y)= min [i{i+ 1)1 - {2il+ +i+ l)x + {I+ 2i+ l)y]. 
0<i<l—l 


A necessary and sufficient condition for the Z-extendibility of the Curie- 
Weiss Ising model with n = 3 is that there exist a {0,..., Z}-valued random 
variable N so that 

+ 2a^h‘^ + ab"^ 


EN 

~r 

EN{N-1) 

1(1-1) 

EN{N -1){N -2) 


+ 30^62 + 30^2 _|_ I ’ 
+ 3a262 + 30^2 _|_ i ’ 

a3 


Z(Z — 1)(Z — 2) + 2>a?‘b‘^ + 2>ab‘^ + 1 

It then follows that this is equivalent to 

[a^ + 6^ + 2ab^)i^ — {2a^l + 2ah^l + h^ — a^)i + a{l — 1)(26^ + oZ) > 0 
for 1 < i < Z — 1 and 

(1 + 2ab‘^ + a^h^)i^ 

- {2ab^l + 2a^b^l - 1 - - 3a^b'^)i + a‘^b‘^{l - 1)(Z - 2) > 0 

for 0 < i < Z —2. The values of the first quadratic at i = 1, Z — 1 are a^(Z — 1)(Z — 2) 
and h‘^{l — 1)(Z — 2), respectively, and the values of the second quadratic at 
Z = 0, 1 — 2 are a?h‘^{l — 1)(Z — 2) and (Z — 1)(Z — 2), respectively. Minimizing 
the quadratics over continuous values of i as before, we see that a sufficient 
condition for Z-extendibility of the Curie-Weiss Ising model for n = 3 is 

min[(o -L 6)^, (1 -|- ab)'^] 


provided that 


6 < 1 + 


Z-2> 


2ab{l - 2) 

max[a^ + b'^,l + 

^ ' 


To check the assumption of Theorem 1.16 for larger values of re, we will 
need to appeal to part of the theory of quadratic forms. Recall from the 
remark following the statement of Theorem 1.16, that in the treatment of 
the continuous moment problem, one writes P{x) as a perfect square, and 
uses the equivalence between the positive definiteness of a symmetric matrix 










28 


T. M. LIGGETT, J. E. STEIF AND B. TOTH 


and positivity of the principal minors of that matrix. In the discrete moment 
problem, the roots of P are simple, so P cannot be written as a perfect 
square. Nevertheless, the “interior” roots must appear as nearest neighbor 
pairs, so that they almost have even multiplicity. To quantify the difference 
that this makes, we need a quantitative version of the equivalence of positive 
dehniteness and positivity of the principal minors. We turn to that next. 

If (7 = (cjj)jj>o is a matrix, we will use the following notation: cz\ _i = 
and for 0 < k < i,j, 


c 


k 

hj 


co,o 


Cfe-1,0 

Ci,0 


C0,fe-1 




Gjfc—1 

p,j 


Note that this is the determinant of a (fc + 1) x (A: + 1) matrix because the 
indexes of the matrix C begin at 0. 


Proposition 4.2. If all of the principal minors ^ of the symmetric 
matrix C are nonzero, then 


(4.4) 


n 

i,j=0 




n 

E 

k=0 


\Y.U 



2 



k-1 

k—l,k—l 


Remarks, (i) Note that this identity provides a simple proof of the stan¬ 
dard fact referred to earlier that a quadratic form is positive definite if and 
only if the principal minors of the matrix of coefficients are all positive. 
The only if direction can be seen by perturbing the quadratic form a small 
amount such that all the principal minors are nonzero. 

(ii) Equation (4.4) above is known as Jacobi’s formula. Jacobi’s original 
approach to it can be found in Chapter X, Section 3 of [9]— see equation 
(28) there. Nonetheless, we decided to include the proof here which seems 
to be different than the one in [9]. 


Proof of Proposition 4.2. We begin with a special case of Sylvester’s 
identity: 


(4.5) 


'^k,kH,j H,k'^k,j '-'k—l,k—l''i,j ’ 


0 < A; < i,j, 


where we have set c^~j^ = 0 if A: = min{i,j}. This special case can be found 
at the bottom of page 586 of [1]. Proofs of the general identity can be found 
in that paper, as well as in [12]. 

Using (4.5), and the observation that the first sum below telescopes, we 
can write 


c 


ij ~ 


min{2j} 

E 


k=0 



Lc 


k-l 

k—l,k—l 


.fc+l 

'i, 



E 


k=0 



k 

k,j 



k-l 

/c—1,A:—1 
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Multiplying this identity by ZiZj, summing, and then changing the order of 
summation gives 






„k „k-l 

2,j=0 k=0i^j=k ^k^k^k—l^k—1 


Zi Zj . 


Finally, use the symmetry of C to get (4.4). □ 


Now suppose P is a polynomial of degree n with n simple roots in 
{0,...,!}, and P > 0 on {0,...,!}. Then the set of roots must be of the 
form {xi, xi + 1} U • • • U {xp, Xp + 1} U A, where A = 0 or {0, Z} if n is even 
and A = {0} or {Z} if n is odd. To see this, suppose P has consecutive simple 
roots at {x, x + 1,..., x + A; — 1} and P(x — 1) > 0, P(x + Zc) > 0. The sign 
of P must change at x,..., x + A: — 1, and therefore k must be even. Now 
group these roots in pairs. For simplicity, for the moment we will take n to 
be even. Similar results hold for odd n, but many of the formulas are a bit 
different. Unlike the “continuous” case described in the remark following the 
statement of Theorem 1.16, (x — Xi)(x — Xj — 1) is not a perfect square, so 
in what follows, we will initially replace this product by (x — Xj — 

Given {xi,..., Xp}, define yo = 1 and 

yq = i-'^y Z! (Xi, + ^) ■ ■ ■ {Xi^ + ^) 

l<il<---<iq<p 

for 1 <q <p. Then we may write 


(4.6) 


n( 

i=l 


1\2 

X - X,; - o = 


n( 

.i=l 
f p 


X — Xi — 


\i=0 


= E 

*, 1=0 




Therefore, if we write 
(4.7) 


2p 


X{{x-xi-\f=Y, 


CiX 


i=l 


i=0 


it follows that for all {vq, ... ,Vn) & -M 


2p p p 

'y ( CjUj = ^ ) V2p—i—jyiyj= 'y ) Vi-\^jyp—iyp—j, 
i=0 

which is a quadratic form in the yp-iS. To apply Proposition 4.2, let V be 
the matrix whose {i,j) entry is Uj+j for i,j > 0. If the necessary principal 
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minors are nonzero, we then have 


(4.8) 


2p 

i=0 


E 


k=0 


[T.Lk'^Lyp-i] 


2 


4,k'^ 


k-1 

k-l,k-l 


This expression is used when A = 0, in which case n = 2p. Similarly, if 
A = {0,/}, we have p = {n — 2) j2 and we consider polynomials of the form 


(4.9) 


p 2p+2 

x{l - x) jj(x -Xi-\f = ^ dix\ 
i=l i=0 


which yield, provided the necessary principal minors are nonzero. 


(4.10) 


2p+2 

E 

i=0 


diVi = 

k=0 


lELk'^^tkyp-i]' 

wlk^k-lk-i 


where W is the matrix whose {i,j) entry is rcj+j+i for i,j > 0, and Wi = 
Ivi — Uj+i for i>l. 

But, of course, (4.7) and (4.9) are not the polynomials we must consider. 
So, we will use the identity 

{x — u){x — u — 1) = {x — u — \ 

to write the correct analogue of (4.7) as 

p 

]^(a; - Xi){x -Xi-l) 

(4.11) ^ 

=E(-i)'' E n 

q=0 


For the correct analogue of (4.9), multiply both sides of (4.11) by x{l — x). 

Next we will illustrate the use of these expressions to check the assump¬ 
tions of Theorem 1.16 for large /. It is this result that we will use in analyzing 
the Curie-Weiss Ising model. We will state it for even n; the case of odd n 
is similar. 


Theorem 4.3. Suppose n = 2m is even and fixed, and vo{l) = 1, vi{l ),..., 
Vn{l) > 0 for I > n are such that the corresponding quantities uf^(/) and 
wfkil) o,xe all positive and satisfy 




k-1 


Am 


w 


'tlk-i(0 


= 7kl^ + o(l^), 


= 7(,/^+2 + o(/^+2), 


k = 0,... ,m, 
k = 0,... ,m — 1, 
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^ j (pO* ^ + o(r ^), 0<k<i<n — k — 2 


{piy ^ + o(r ^), 0<k<i<n — k 


as I —> oo, where 0 < /) < 1 and 7^,7^ > 0. Then for sufficiently large I, 
vi{l),... ,Vn{l) are the first n moments of some {0,1,... ,l}-valued random 
variable. 

Remarks, (i) In view of the positivity of v^f,{l) and w^j^{l), it follows 
from criteria (2.2) that for such I there is a random variable N with values 
in [0,1] with moments vo,...,Vn. The point of the above assumptions is to 
be guaranteed that N can be taken to have values in {0,..., 

(ii) If a family of finite moment sequences is Textendible (in the sense 
of the conclusion of this theorem) for all large I, then (by passing to subse¬ 
quences) there is at least one limiting distribution of an infinite exchangeable 
sequence. Theorem 4.3 is formulated for the situation in which this limiting 
distribution is the product measure of density p, because that is the case 
that arises in our analysis of the Curie-Weiss Ising model. One could pre¬ 
sumably also use our technique to formulate analogous results for situations 
in which the limiting distribution is more general. 

(hi) By the third display above with fe = 0, we see that Vi{l) = {piy + o{l^). 
Therefore the typical summand in the determinant is of order {pl)^ 
and hence the expression in the first display above is potentially of order 
{piy^^ ■ For it to be of order l^ as is assumed here, there must be a lot 
of cancellation in the determinant. This is analogous to the fact that the 
variance of the sum of m i.i.d. random variables is of order m even though, 
without the cancellation that occurs, it would be of order m^. 

(iv) One can see from the proof that one actually only needs the third 
display to hold for i <m and the fourth display to hold for i <m — 1. 

Proof of Theorem 4.3. We will apply Theorem 1.16. Consider a 
sequence Pfix) of polynomials of the form (4.7) in which Xi = xfil) = afi + 
o{l) for i = 1,... ,p, where 0 < ctj < 1 for each i. Then the corresponding yfis 
satisfy 


yq = {-^f ai^---ai^ + o{P). 


l<i\<-"<iq<p 


Using the hypotheses of the theorem, it follows that the right-hand side of 
(4.8) equals 


V 


2 




^ (aq - P) • • • (aip.fc - p) 
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Next we must account for the fact that the polynomial that arises in Theo¬ 
rem 1.16 is of form (4.11) rather than (4.7). If Ci{l) is defined by 

m n 

Y[{x - Xi{l)){x - Xi{l) - 1) = '^Ci{l)x\ 

1=1 i=0 

then 

n 

i=0 

= (1+0(1)) 

\B\ 1 12 

X E (-ir-'^'E7.^^'^'-^ E • 

k=0 lDcB,\D\=\B\-ki£D 

We need to check that this quantity is strictly positive for large I for any 
choice of ai,..., £ [0, !]• The argument depends on how many of the a^’s 

are equal to p. For example, if 7 ^ p for all 1 < i < m, then the term above 
corresponding to i? = { 1 ,..., m} and /c = 0 is a positive multiple of and 
all other terms are of smaller order. Suppose now that ai = p and p for 
all 2 < i < m. Then the dominant term is of order and corresponds 

to i? = {1,..., m} and k = 1. More generally, suppose ai = ■ ■ ■ = aj = p and 
ai^ p for all i = j + 1,... ,m. Then the only H’s that can contribute to the 
expression satisfy \D\ <m — j, since if \D\ > m — j, one of the factors ai — p 
must be zero. Therefore, for all nonzero summands above, \B\ — k <m — j, 
and hence 

(4.12) 2|i?| — k = {\B\ — k) + \B\ < (m — j) + m < 2m — j. 

So, the largest power of I that occurs in the above expression is 2m — j. It 
can only occur if equality occurs in (4.12), that is, if |S| = m and k = j. But 
in this case, the coefficient of ^ is 7i n™ — pY-, which is strictly 

positive. 

To complete the consideration of polynomials P of the form (4.11) without 
assuming that Xi{l)/l have limits, one passes to subsequences using compact¬ 
ness. The argument for polynomials of the form 

m—1 

x{i -x) n -Xi- 1 ), 

i=l 

is similar, using the assumptions on the re’s rather than the u’s. We have 
now verified that if Pi(x) = Yl'i=Q Ci{l)x^ is a polynomial for each I > n that 
has n simple roots in { 0 , ■ ■ ■ ,1} and is nonnegative on { 0 , ... ,1}, then for 
sufficiently large I, Ci{l)vi{l) > 0. It follows from Theorem 1.16 that 
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for such I, vi{l),... ,Vn{l) are the first n moments of some {0 ,... ,Z}-valued 
random variable. □ 


We are now ready for the 

Proof of Theorem 1.15. For the Curie-Weiss Ising model, we have 

' n — k' 


P{Xi = l,...,Xk = l) = Y. 

j=k 

where uj = /Sn and 


j -k 


u 




1 < fc < re, 


Sn = J2 

j=0 

With this notation, (4.1) becomes 
EN{N - k + 1) 

(4.13) 


J ‘ 




j=k 




1 < k < n. 


Expanding the product on the left-hand side of (4.13) and writing Vk{l) = 
EN’^, we can solve the resulting equations for these quantities. The issue is 
whether vi{l), ..., Vn{l) are the first re moments of a { 0 ,..., /}-valued random 
variable. 

Defining Wk{l) = lvk{l) — Vk+i{l), we will apply Theorem 4.3 to prove 
part (a), and therefore assume at this point that re is even. To verify the 
assumptions of Theorem 4.3, and also for the easy proof of part (b), we will 
need the following asymptotic statements: As Z —> oo, 

(4.14) ~ (^n i! j |ip(l - 

(4.15) wltil) ~ ( ni!)5‘<'=+'>''"b(l - p)]('=+i)('=+2)/2;('=+i)(w)/2. 


\j=0 / 


(4.16) <,(Z)~^j(pZ)-Xfc(0 and <,(Z) ~ ^ j (,.Z)*-^<fc(Z), 

where (5 = 1 — 2cp{l — p). 

The hypothesis of part (a) of the theorem gives 5 > 0, which is what we 
need to apply Theorem 4.3. Part (b) of the theorem follows immediately 
from (4.14) with Zc = 1, since 


^^ 1,1 = 


Vo vi 

Vi V2 


= V2-Vi, 
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which is nonnegative if vi,V 2 are the first two moments of any random 
variable. 

To check (4.14), (4.15) and (4.16), we need to solve (4.13) explicitly. In 
order to do so, let G = {gi,j)o<i,j<n and its inverse H = {hij)o<i,j<n be 
defined by 


and 


G 


/ 1 \ 

I 

P 


1 \ 
I 





( ^ ^ 




1 


1 

H 


= 

P 




vv 


for every 1. These are the lower triangular matrices 


/I 

0 

••• 0\ 

0 

1 

... 0 

0 

-1 

... 0 

0 

2 

... 0 

Vo 

(-l)^-i(n-l)! 

... l] 


/I 

0 

0 

••• 0\ 

0 

1 

0 

... 0 

0 

1 

1 

... 0 

0 

1 

3 

... 0 

Vo 

1 

2^-1 _ 1 

... l) 


Entries in these matrices other than those given above are rather compli¬ 
cated. However, they are completely determined by equating coefficients of 
powers of I in the defining relations above; we will later give recursive expres¬ 
sions for them. To simplify the following expressions, we will often suppress 
the dependence on I, and write Vk = Vk{l)- We will also suppress the limits 
of the following sums, relying on the usual convention that (™) = 0 except 
when 0 < k <m. Then the solution of (4.13) is given by 


Vm 


k9k,jP 


f n — k 
\i-k 


Ui. 


(4.17) 
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To motivate the next step, recall that we are trying to prove that for large 
I, the Urn’s are the moments of a random variable N that has a distribution 
close to B{l,p). If this were the case, then E[N — IpY should be of order 
/p/2 rather than P. We will now check this without assuming that the u^’s 
are moments at all. Recalling that /9 = a/(l + a), it is natural to consider 
the following, which we rewrite using (4.17): 

m ^ ' 




n — k 
i — k 


Ui. 


Next write 6 = 1 + (c/Z) and use the binomial expansion for powers of 1 + (c/Z) 
to write 


Sn 



{l + aTi-lay-^Vm 


(4.18) 


where 


= T (,K)(i+«r(-tar’"'im,»,A 

i,j,k,m,q ^ 

^ zV-*(i + 

r,s,t:r,s>0,r-\-s-\-t<p 


P n 

C'r.,M(«) = EE«*""(l + «)" 


k=0i=k 


P \ 
k + s J 


(-i)‘ 


X ' 

\p — r — s — t 

In the final step, we have let r = k — j, s 
Shortly, we will show that 


fn-k\ , 

I ^ i^s+k,k9k,k—r- 

m — k and t = j — q + p — m. 


(4.19) Cr,s,t{a) 


0 , 


X 2*-'-^(l +a)"-P+2''+2^, 


if t > p/2, 


if t =p/2. 


But first, we will use this to complete the verification of (4.14). 
Since Sn ^ (1 + n)"" as Z —> oo, it follows from (4.19) that 


(4.20) 



if p is odd, 

r^[i-3---{p-i)][6p{i-p)ir/y 

if p is even. 
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Applying row and column operations similar to what was done in the alter¬ 
native proof of Theorem 1.1, we see that v^^{l) can be written as 


Wo 

Wl — plvo 


Vi - plvo 


V 2 — 2plvi + p^l^vo 


T, (™) 

m 

m 


Ely ■■■ E(m) 


Using (4.20), we see that 


1 

EZ 

EZ'^ • 

• EZ’^ 


■+k 

EZ 

EZ"^ 

EZ^ 

■ EZ’^+^ 



EZ^ 

EZ^+^ 

EZ^+^ . 

■ EZ'^^ 


where Z is a standard normal random variable. By Corollary 4C in [17], 
this last determinant is 1!2! ■ ■ - kl. It can also be deduced from results on the 
P = 2 Gaussian ensemble—see Chapters 4 and 17 of [18]. 

It remains to prove (4.19). First, we need some information about the 
matrices G and H. By the definition of G, for example, 


1=0 

Therefore, 

2+1 2 2+1 2 

= 9i,jV = gij-iV - i gi,jV. 

1=0 1=0 1=1 1=0 

Equating coefficients of powers of I gives gi+ij = gi,j-i — igij and then 
solving the resulting recursion leads to 

1 

gi+l,j ~ ~ ^ ' gi—j+k,k{i ~ j k). 

k=l 

Similarly, hi^ij = hij^i + jhij and 


k=l 

It follows by induction that 
(-!)'■ 

gk k-r =-X a monic polynomial in k of degree 2r, 

(‘1.21) ' 

hs+k,k = X a monic polynomial in k of degree 2s. 
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The next step is to prove by induction on j that if P is any monic poly¬ 
nomial in i of degree j, then 


(4.22) 


^ p(i)o-‘ ("_(:)= Q(m +aT-"-’ 


for some monic polynomial Q in A: of degree j. This is clearly true for j = 0. 
For the induction step, write 





= (n — k)a i' 








n — k 
i — k 


= (n - A:)aQi(A: + 1)(1 + + kQ2{k){l + 

= Q(A:)(l + a)-"-^ 

where Qi{x) and Q 2 {x) are monic polynomials of degree j — 1 (by the in¬ 
ductive hypothesis) and 

Q{k) = {n- k)aQi{k -L 1) -L A:(l -|- a)Q 2 {k) 


is a monic polynomial of degree j. 
Now start with 


j:(*;)(-i)‘i‘=(i-x)p, 


differentiate j times with respect to x and set x = \. The result is that 


E 


(_l)A3fc(A:-l).-.(fc-j + l) = 


0 ’ 

(-l)-’p!, 


if j < p, 
if j = p. 


It follows that 
(4.23) ^ 

k 




0, if J < p, 

(-l)-’p!, ifj=p. 


Since is a polynomial in i of degree 2{p — r — s — t) and leading 


coefficient (—1)^ 


—r—s—t 




—k 


/{p — r — s — t)\, (4.22) implies that 

i{n — i) \ k n — k 
p — r — s — t) \ i — k 


— _|_ g^n-k-2{p-r-s-t) 

{p — r — s — t)\ 


Q{k), 
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where Q{k) is a monic polynomial of degree 2{p — r — 




{p — r — s — t)\ 


-I- (i'jn-2{p-r-s-t) 


E 


2’'+®r!s!(p — r — s — f)! ^ 

(-1)P-*(1+ a)’"-2(P-^-^-d Jo, 


P 

k + s 


2^+^r!s!(p — r — s — f)! l(2(p —t))!, 


s — t). Therefore, 


i-l)’^xQ*ik) 

if 2{p -t)<p, 
if 2{p-t)=p. 


Here Q*{k) is a monic polynomial of degree 2{p — t). We have used (4.21) 
in the middle equality and (4.23) in the final one. 

This completes the proof of (4.14). The proof of (4.15) is similar. In par¬ 
ticular, we now know that for large I, j^{l) and w\. ^(/) are strictly positive. 
It follows from criteria (2.2) that for such I there is a random variable N 
with values in [0,/] with moments vq,. .. ,Vn- We still do not know it can be 
taken to have values in {0,and for that we still need to check (4.16). 
Note that the expression for that follows (4.20) can then be written 

as 

1 E{N -pi) • • • E{N - pl)^ 

E{N - pi) E{N - pl)^ ■ ■ ■ E{N - pl)’^+^ 

E{N-pl)^ E{N-pl)^+^ ••• E{N-pl)^^ 


Similarly, row and column operations can be used to write v^j^{l) as 


1 

E{N -pi) 

E{N-pl)’^ 

E{N-pl) 

E{N - pl)'^ 

E(N-pZ)'=+i 


E{n'-pI)^ 

E{N- pl)^'^-^ 

E{N - plf fk{pl,N) 

E{N-pl)'^+^fk{pl,N) ■■■ 

E{N-pl)'^^h{pl,N) 


where 

fk{x,y) = ^ j(j - 1) • • • (j - 

j=0 


Note that while it might appear that E(N — pl)^fk{pl,N), for example, is 
a linear combination of all the moments of N up to order i, and therefore 
would not be obtainable as a linear combination of the moments of order 
0,1,..., A; — 1 and i that originally appeared in the first column of the matrix, 
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it really is a linear combination of these latter moments. This can be seen 
by writing 


fk{x,y) 


(]k I _ j,* 

dx^-^ y — X 


To complete the proof of the first part of (4.16), note that by (4.20), 
N/l p in probability, and therefore, 


fk{pl,N) 


i—1 


^j(i-i)---(j-fc + 2) 

j=0 



in probability. It follows that asymptotically, one can factor out a term 


(()(p<r‘ 

from the last row of the above matrix. This gives the first part of (4.16). 
The proof of the last part is similar. □ 


Remarks, (i) In case (a) of Theorem 1.15, we can consider for large I a 
random variable Ni that corresponds to the extension of length I of the 
Curie-Weiss Ising model. (This is presumably not unique.) By (4.20), any 
weak limit of 

Ni-pl 

yJ5p{l- p)l 

as 1 —> oo has the same hrst n moments as a standard normal random vari¬ 
able. 

(ii) Diaconis and Freedman [5] proved that if the exchangeable measure 
p on { 0 , 1 }” is 1 -extendible, then the total variation distance between p 
and the closest mixture of homogeneous product measures is at most 4n/L 
Combining this statement with Theorem 1.15 yields the following conclusion: 
If Pi is the distribution of the Curie-Weiss Ising model on {0,1}"' with 
a = p/{l — p) and 6 = 1 -|- (c/ 1 ), where c < l/{2p{l — p)), then 

lim sup 1 inf 

Z—>oo ^ 


Pi - [ Vrlidr) 

Jo 


< 4n, 


TV 


where is the homogeneous product measure with density r, and the infi- 
mum is over probability measures 7 on [ 0 , 1 ]. 

One could try to take our analysis further at the critical point c = l/{2p{l — 
p)) (with p 7 ^: 0,1). To see what can happen, let 

1 ^ 

2 p(l — p)l 1 ^ 


(4.25) 


6=1 + 
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Theorem 4.3 no longer applies, but for small n, one can apply Theorem 1.16 
directly, as we did earlier in this section. Here are some results for n = 
2, where (4.3) is being used, that illustrate the complexity of the answer: 
Suppose p is rational, and write p= j:- Take integers m G [0, A:/2] and q G 
[0, k) so that jq = m (mod k) or jq = k — m (mod k). Then the critical value 
for ^-extendibility for all large I = q (mod k) is 

_ pjl-p) + (m/fc)^ 

2 p 2 (l -^)2 • 

Perhaps surprisingly, if n > 4 and p 7 ^ |, there is no value of d for which 
/-extendibility holds for all large 1. To see this, one can solve (4.13) for A: < 4 
and check that Ni, if it exists, must satisfy the following, as I —> 00 : 

71 — 1 

E{Ni-pl)^^{l-2p\ 


E{Ni-plf 


{•n? — 4n + 6 ) — — 5n + 6 )p(l — p) 


-2dp‘^{l-pf, 


E{Ni- plf ^ p{l- p){l-2p)l, 


E{Ni - plf ~ 2p(l - p) [n(l - 2pf + 13p(l - p) - 3]L 
If p / i, this violates the Schwarz inequality: {E{N — pl)^)"^ < E{N — pl)‘^E{N 
pl)'^. If p= one must compute higher moments to draw the same conclu¬ 
sion (for n > 6 ): 


E{Ni-plf 


2n — d 
8 


E{Ni-pl)^^y, 


E{Ni - plf ~ 


30n - 15d - 56, 
64 


Presumably, this means that the power of I that is used in the correction 
in (4.25) is not necessarily 2, and may depend on p and/or n. We have not 
investigated this further. 


5. A formula for finite extensions. In this section, we prove Proposi¬ 
tions 1.17, 1.18, 1.19 and 1.20 as well as Theorem 1.21. 


Proof of Proposition 1.17. Fix n, J > 0, h>0 and I > n. It is easy 
to check that for A; = 0,1,..., re, the probability that there are k I’s in the 
Curie-Weiss Ising model with parameters re, — J and h is given by 

— exp{—J(2A: — re)^/2 -|- h{2k — re)} 

\ ^ / J —CO 







EXCHANGEABILITY AND MEAN FIELD MODELS 


41 


Now, define for each j e {0,..., 

Q{j) := ^ 

For /i 7 ^ 0, there is no singularity and so the integral is well defined. It is not 
hard to see that Q{j) is real but our later computation will verify this. It 
is also straightforward to verify the claim concerning (1.12) for h> 0 with 
Q replaced by Q. (It is also true that the Q corresponding to I' > I has 
the Q corresponding to I as hypergeometric projection in the obvious sense. 
Although not so interesting, the above makes sense and is also correct when 
I < n.) The proof will be complete if we show that Q = Q when h> 0. The 
h = 0 case of (1.12) will then follow from continuity. 

To see that Q = Q when /i > 0, we now let M := I — k > 0 and u := 2j — I G 
{—I, —l + 2,...,l — 2,l}. If /i > 0, we have 

/ CXD 

^{ix+h)u(^^{ix+h) ^ e-(“+^))"^/j(x)dx 

-CO 

/ CXD 

+ e-2(*^+^))-^/j(a;) dx 

-CO 


— CO 
CO 

I 

— CO 


^{ix+h){u—M) 


E 

m=0 


-M 

m 


= E 

m=0 

oo 

= E 

m=0 


-M 

m 

-M 

m 




e-2™(“+'*)/j(x)dx 

) 

e-“(2"^+^-“)/j(x)dx 




= e^V(2 J) 


E 

m=0 


-M 

m 


^—Jf2{2m+M—u+hfJ)'^ 


as desired. □ 


Proof of Proposition 1.18. First, assume that n is odd and h = 0. 
In this case, we will apply Proposition 1.17 by simply verifying that the 
expression given there is nonnegative. The series we need to consider is 


ZLire- 

m=0 


(J/2)(2m+l+u)^ 


When u > 0, this is nonnegative since it is an alternating series with terms 
whose absolute values are decreasing. If u = —2j, where j = 1,2,..., write 
this series as 


2i-i 

^_-y'^m^—(J/2){2m+l+u)'^ 

m=0 



+ ■£ (-ire" 

m=2j 


(J/2){2m+l+u)^ 
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The second sum is nonnegative again because it is an alternating series with 
terms whose absolute values are decreasing. The first sum is zero, since the 
summands for m and 2j — m — 1 cancel. 

For the converse, we use our a,b parameterization. Fix n, a, b and 1. 
If the Curie-Weiss Ising model with parameters a and b on {0,1}"' is l- 
extendible, then there must exist a random variable N taking values in 
{0,..., 1} satisfying (4.13). Using this equation, we can compute the variance 
of N which then turns out to be 



n — 1 
i — 1 


n — 1 

j - 1 


^i+jl^i{n-i)+j{n-j) ^ 


It is clear that all cases fall into one of the following three cases. 


Case (1): n is even, i = n + l and a is arbitrary. In this case, the dominant 
term (as b gets large) is i = j = nj2 which can be seen to have a negative 
coefficient. Hence for large b the above is negative and the extension does 
not exist. 

Case (2): n is odd, i = n + 2 and a is arbitrary. In this case, there are 
four dominant terms (as b gets large) corresponding to i, j G {(n — l)/2, (n + 
l)/2}. An easy computation shows that in this case the sum of the coef¬ 
ficients of these four terms is negative and hence for large b the above is 
negative and the extension does not exist. 

Case (3): n is odd, i = n+l and a / 1. In this case, there are again the 
same four dominant terms (as b gets large) as in the previous case and an 
easy computation again shows that in this case the sum of the coefficients of 
these four terms is negative and so, as before, the extension does not exist. 
□ 


Proof of Proposition 1.19. Again, the series we need to consider is 


Zoire- 

m=0 


(J/2)(2m+I+u)^ 


If u > —1, then this is an alternating series with terms whose absolute values 
are decreasing and hence is nonnegative. Otherwise, write u = —2j — 1 with 
J = 1,2,3,... where there are only finitely many j’s here. Then the above 
sum becomes 


^(-l)-e 

m=0 


2J(m-i)2 


Break the sum into m < 2j -|- 1 and m > 2j + 2. The second sum is fine 
as before. For the first sum, expand the exponential in powers of J. The 
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constant term is 0 because there are an even number of summands. The 
coefficient of J in the expansion is 

2j+l 

= 2(j + l)>0. 

m=0 

This term dominates for small J and hence the sum is positive. We now 
apply Proposition 1.17. □ 

Proof of Proposition 1.20. Fix n, I and p G (0,1). The measures in 
E{1,1) are ^dimensional and correspond to a simplex Am. . Similarly, the 
measures in E{n, n) are n-dimensional and correspond to a simplex B in R^. 
The hypergeometric projection corresponds to a linear mapping / from A to 
B whose image is a set C in corresponding exactly to E{n,l). Clearly as 
a map from AtoC,f has full rank. Consider the point a in A corresponding 
to the process Y = (Fi, ... ,Yi) which is i.i.d. 0, 1 valued with P{Yi = 1) =p. 
Clearly a is an interior point of A. Since / has full rank, /(a) is an interior 
point of C. However, /(a) corresponds exactly to X = (Xi,...,X„). This 
proves the claim. 

For the last statement, fix p = 1. We take n = 4 and I = 5. Consider the 
measure on {0,1}'^ which is (1 — s)mi + em 2 where mi is product measure 
with p =1 and m 2 is uniform distribution on configurations with exactly 2 
I’s. It is easy to see that for any e > 0, this measure, while in 77(4,4), is not 
5-extendible. □ 

Remark. There is an alternative way to prove the above result. When 
one extends the product measure from n to I sites, the resulting random 
variable N satisfying (4.1) is binomially distributed. By Proposition 4.1, we 
therefore have that for every polynomial P G Ve, EP{N) > 0. Since \Ve\ < 
00 , it follows by Theorem 1.16, that if the finite sequence vq, ■ ■ ■ ,Vn is close 
to these binomial moments, then they are also the moments of some N' of 
the desired form. This, together with (4.13), completes the alternate proof. 
In fact, this proof shows that whenever we have a process {Xi,... ,Xn} in 
E{n,l) which has some “representing” N satisfying (4.1) having at least 
n -|- 1 points in its support, then small perturbations of {Xi ,..., Xn} which 
are in E{n,n) are also in E{n,l). 

We finally now move to the proof of Theorem 1.21. 

Proof of Theorem 1.21. Fix n,c,h and 1. Letting J = cll, we have, 
using (1.11), Q{j) defined for each j G {0,...,/}. We want to show that for 
h > h*{c), we have that for large I, Q{j) is nonnegative for all j G {0,... ,1}. 
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Since h*{c) > 0, in view of the proof of Proposition 1.17, we need to show 
that for j G {0,... ,1}, 


<3(i) - 


f 




where we recall that fj is the density function for a normal random variable 
with mean 0 and variance J. 

Since we do not care about positive multiplicative factors, we will use the 
notation ^ = i? if the A and B only differ by a positive multiplicative factor. 
Letting 2j — I = vl with v G [—1,1], a simple change of variables shows that 


(5.1) 



^ixv\/J ^^h+ix / \/l 


+ e-^-^^/^‘)r^-ifc{x) dx. 


Thinking of x as complex, the integrand on the right-hand side of (5.1) 
has isolated poles of order I — n at the points x = {ih + (2r -|- l)7r/2)v7 for 
r G Z and is otherwise analytic. One can then readily deduce from Cauchy’s 
theorem that if h,x> 0, then the integrand in (5.1) is unchanged if we 
integrate over M -I- i{h — x)v^ instead. This leads to 

Q{j)^ r + 

J — OO 


(5.2) 


(e 


X+ixlVl 


=-X-*T 


/Viy 


X (^gX+(W\/^)(l-'!'+(^-x)/c) 


+ e-^-(io=/VI)[i+v-[h-x)/c)yl^y^^^ 


Now, assume y = x(c, h,v) is a solution of the equation 


(5.3) 


^ — ctanh,^ = h — cv, ^ > 0. 


Denote 

p = p{c,h,v) 
and observe that 

p + q=l, 


eX 

-|- e~^ ’ 


q = q{c,h,v) 


e-^ 


p — q = tanh x and ^pq = - -^^ 

(coshy)^ 


We then readily obtain 

(5.4) Q{j) ^ r + ge-“/^)^(pe*29^/^ + ge-*2P’’’/^)-Vc(x) dx. 

J — OO 


We want to apply the dominated convergence theorem to the integral on the 
right-hand side of (5.4), with h and c kept fixed, I oo and uniformly in 
u G [-1,1]. □ 
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Choice of x- At this point, we want to understand when (5.3) has a 
solution and we treat the cases c < 1 and c > 1 separately. If c < 1, it is easy 
to check that equation (5.3) has a solution for all v G [—1,1] if and only if 
h> c and moreover the solution is then unique for all such v. 

The c > 1 case is a bit longer. Let x* be 

(5.5) x*=X*ic) :=ln(\/c+\/c- 1), 
which is equivalent to 

(5.6) (coshx*)^ = c and x* > Oj 

or to the fact that x* is the unique local minimum of — ctanh^. 

Observe that 

(5.7) /3{c) := \n{^/c + Vc — 1) + c — V— c = X* — ctanhx* + c. 

This is, of course the same as 

(5.8) /3(c) — c = min(^ — ctanh^). 

Again, (5.3) has a solution for all v G [—1,1] if and only if 

(5.9) h>P{c). 

For h > [3{c), we choose the solution of (5.3) with 

Y> A*- 

Given c and h > /3(c), we denote by x = x(c, h) the solution of the equation 
(5.3) with V = +1. Clearly, 

X*<X= min.^X 

and hence, due to (5.6) we always have 

(5.10) sup 4pgc = 4pqc < 4p*g*c = 1, 

where p,q,p* and q* all have the obvious meaning. Furthermore, keeping c 
fixed, [/3(c), oo) 3 x(c, h) G [x*(c), oo) is strictly increasing in h with 

(5.11) lim x(c,/i) = oo. 

/ l—>00 


Pointwise convergence. For fixed c and h > /3(c), the integrand on the 
right-hand side of (5.4) converges pointwise to 


(5.12) 


exp{—x^(l — 4pgc)/(2c)} 
\/27rc 


as / —> oo uniformly on compact domains of x and in the parameter v G 
[—1,1]. Due to (5.10), which holds for all c and h > f3{c), the limit function 
is integrable uniformly for v G [—1,1]. 
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Domination. For e G [0,1] we define 

(5.13) c(e):=sup|c: inf — e(siny)^) = ll. 

—oo<y<oo j 

Note that e c(e) is monotone decreasing, with 

(5.14) limc(e) = (X) and c(l) ;= lim c(e) = 0. 

£:\0 eyi 

It follows that for all c, h and v 

(5.15) inf Iey"/m4pg))(^pgi2gy ^ ^^-i2py^l ^ 

— cci<y<oo 

and this will be used in order to bound the integrand on the right-hand side 
of (5.4). 


Lemma 5.1. IFe have: 

(i) For any £ G [0,1] 

(5.16) - < He) < min{ 

(ii) Fore <2/3 

(5.17) = ? 


vr 


41n(l — e) ’ e 


Proof, (i) We obtain the first upper bound in (5.16) by looking at 
y = nj2 and we obtain the second upper bound by expanding near y = 0 
in (5.13). In order to prove the lower bound of (5.16) note that for any 
e, Q( G [0,1], 

1 — ea > (1 — e)“. 

Hence 


1 — e(siny)^ > exp{(siny)^ ln(l — e)} > exp{y^ln(l — e)}. 

(ii) In view of the first part of this lemma, we need only check that for 
e<2/3, 


g{y) = e^y\l-£sm‘^y)>l, y G M. 


To do so, note that ^(0) = 1 and compute 

g'{y) = 2£e^y\{y), 


where 


h{y) = y{l — esin^(y)) — sin y cos y. 
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Then 


h' (y) = siny((2 — e) siny — 2eycosy), 


which is zero if (i) sin y = 0 or (ii) 

(2 — e) siny 

y = ^ - 

2e cos y 

In case (i), h{y) = y, while in case (ii), 


h{y) 


siny (2 — 3e + sin^ y) y(2 — 3e + sin^ y) 
2e cos y 2 — e 


If e < 2/3, we see that h{y) and y have the same sign at each critical point of 
h. Since h{y) ^ oo as y —> oo in this case, it follows that h{y) > 0 for y > 0, 
and hence that g is increasing on [0,oo). Therefore, y(y) > 1 for all y. □ 


Now we return to the boundedness of the integrand on the right-hand 
side of (5.4). Let 

c=c{c,h):= min c{Apq) 

— 1<V< + 1 

(5.18) 

^ =c(l/(coshx(c,/i))^). 

Lemma 5.2. (i) If 

(5.19) c<c{c,h) 

holds, then the integrand on the right-hand side of (5.4) is bounded by 
(c-c)/(2cc) j ^2-j^c for all x, I and v G [—1, 1] . 

(ii) For any c, if h> h*{c), then (5.19) holds. 

Proof, (i) We clearly have 

|(ye“/^ + ge-“/^)’"| <1. 

Using c defined in (5.18) we write 

(pg*2ga:/W _|_ ^^-i2pxlVl'^-l^-x^l(2c) 

_ l^^{xlVlfl{2c)^p^i2qxlVl _|_ ^g-i2px/UZ^^-ig-a;2(c-c)/(2cc)_ 

From (5.15) and (5.18) it follows that the absolute value of this last expres¬ 
sion is bounded by e-*^(c-c)/( 2 cc)_ 

(ii) Due to (5.11) and (5.14), it is clear that for any c, (5.19) holds for 
all large h. Carrying out a tedious calculation leads to the statement for 
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c < 3/2. For the case c > 3/2, we need only observe that once h > /3(c), we 
have that 

, 11 1 1 

2/3 > — =_>_=_ 

~ c (coshx*(c))2 (coshx(c,/i))2 c(l/(coshx(c,/i))^) ’ 

the last equality following from (5.17). □ 

Conclusion of the proof of Theorem 1.21. Fix c>0 and h> 
h*{c). Lemma 5.2(11) tells us that (5.19) holds. It then follows from the 
uniform convergence in (5.12) and Lemma 5.2(1) that the integral on the 
right-hand side of (5.4) converges to 1/^1 — Apqc, as I —> oo, uniformly in 
u G [—1,1]. Since 1/-^/1 — Apqc is clearly bounded away from 0 uniformly in 
V, it follows that this integral is positive for all large 1. □ 
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